# Thinking mode switching
Qwen3 0.6B Llamafile
Apache-2.0
Qwen3 is the latest generation of large language models in the Qwen series, offering a dense model with 0.6B parameters, achieving breakthrough progress in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model
Q
Mozilla
250
1
Qwen3 8B AWQ
Apache-2.0
Qwen3-8B-AWQ is the latest generation of large language model with 8.2B parameters in the Tongyi Qianwen series, which uses AWQ 4-bit quantization technology to optimize inference efficiency. It supports the switching between thinking and non-thinking modes and has excellent reasoning, instruction-following, and intelligent agent capabilities.
Large Language Model
Transformers

Q
Qwen
13.99k
2
Qwen3 30B A3B AWQ
Apache-2.0
Qwen3-30B-A3B-AWQ is an AWQ quantized version based on the Qwen3-30B-A3B model, suitable for text generation tasks and supporting the switching between thinking mode and non-thinking mode.
Large Language Model
Transformers

Q
cognitivecomputations
14.45k
12
Qwen3 235B A22B INT4MIX
Apache-2.0
Qwen3-235B-A22B is the latest generation of Tongyi's large model series, offering a range of dense and mixture-of-experts (MoE) models. It has made breakthroughs in inference, instruction following, intelligent agent capabilities, and multilingual support.
Large Language Model
Transformers

Q
fastllm
144
2
Qwen3 8B GPTQ Int4
Apache-2.0
Qwen3-4B is the latest large language model in the Qwen series, featuring the ability to switch thinking modes, powerful reasoning capabilities, excellent human preference alignment, outstanding agent capabilities, and multilingual support.
Large Language Model
Transformers

Q
JunHowie
2,365
2
Qwen3 32B GPTQ Int8
Apache-2.0
Qwen3-8B is a large language model in the Qwen3 series. It has the characteristics of a causal language model and performs excellently in reasoning, multilingual support, agent capabilities, etc., bringing users a natural and smooth conversation experience.
Large Language Model
Transformers

Q
JunHowie
2,070
3
Qwen3 32B 128K GGUF
Apache-2.0
Qwen3 is the latest generation of large language models in the Qwen series, offering a range of dense and mixture-of-experts (MoE) models. Based on extensive training, Qwen3 has made breakthroughs in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model English
Q
unsloth
20.51k
20
Qwen3 8B GGUF
Apache-2.0
Qwen3 is the latest generation of large language models in the Qwen series, offering a range of dense and mixture-of-experts (MoE) models. Based on extensive training, Qwen3 has made breakthroughs in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model English
Q
unsloth
64.32k
39
Qwen3 30B A3B GGUF
Apache-2.0
Qwen3 is the latest large language model series developed by Alibaba Cloud, supporting dynamic switching between thinking mode and non-thinking mode, and excelling in reasoning, multilingual support, and intelligent agent capabilities.
Large Language Model English
Q
unsloth
261.09k
169
Qwen3 32B Unsloth Bnb 4bit
Apache-2.0
Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a dense model with 32.8B parameters, achieving breakthrough progress in reasoning capabilities, instruction following, agent functionality, and multilingual support.
Large Language Model
Transformers English

Q
unsloth
10.03k
5
Qwen3 30B A3B
Apache-2.0
Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a complete combination of dense models and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in inference ability, instruction following, intelligent agent functions, and multilingual support.
Large Language Model
Transformers

Q
Qwen
218.81k
571
Qwen3 14B
Apache-2.0
Qwen3-14B is the latest large language model in the Tongyi Qianwen series, with 14.8 billion parameters. It supports the switching between thinking and non-thinking modes and performs excellently in reasoning, instruction following, and intelligent agent capabilities.
Large Language Model
Transformers

Q
Qwen
297.02k
152
Qwen3 0.6B
Apache-2.0
Qwen3-0.6B is the latest generation of large language model with a parameter scale of 0.6B in the Tongyi Qianwen series. It supports the switching between thinking and non-thinking modes and has powerful reasoning, instruction-following, and intelligent agent capabilities.
Large Language Model
Transformers

Q
Qwen
497.09k
264
Featured Recommended AI Models